PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KHN17120.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family MYB
Protein Properties Length: 1650aa    MW: 180912 Da    PI: 5.9113
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KHN17120.1genomeTCUHKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding26.12e-08784825346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT+eE e +++ ++ +G++ +++Ia+ +  ++t  +c+++++k
       KHN17120.1 784 PWTPEEREVFLEKFAAFGKD-FRKIASFFD-HKTTADCVEFYYK 825
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding32.81.6e-109721011344
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                        WT +E   +++av  +G++ +++Iar++g +R+ +qck ++
       KHN17120.1  972 DWTDDEKTAFLRAVSSFGKD-FAKIARCVG-TRSQEQCKVFF 1011
                       5*****************99.*********.********766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.38E-14768828IPR009057Homeodomain-like
PROSITE profilePS5129315.919780831IPR017884SANT domain
SMARTSM007179.4E-9781829IPR001005SANT/Myb domain
PfamPF002494.3E-6783825IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.1E-5784825IPR009057Homeodomain-like
PROSITE profilePS5129312.7239681019IPR017884SANT domain
SMARTSM007172.7E-89691017IPR001005SANT/Myb domain
SuperFamilySSF466891.17E-99701019IPR009057Homeodomain-like
PfamPF002499.8E-99721011IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.1E-69721011IPR009057Homeodomain-like
CDDcd001671.19E-79731011No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1650 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSHHRDF NRWGSAEFRR PPGHGKQGGW  60
HLFSEEPGHG YAISRSSSDK MLEDDSRPSI SRGDGKYGRS SRENRGGPFG QRDWRGHSWE  120
PNNGSMNFPR RLQDVNNDQR SVDDALAYSS HPHSDFGNAW DQHHLKDQHD KMGGVNMFGT  180
GPRSDRDNSL GDWKPLKWTR SGSLSSRGSG FSHSSSSRSM GGADSHEVKA ELLPKSVAAN  240
ESHSGEAAAC ATSSVPSEDT TSRKKPRLGW GEGLAKYEKK KVEVPDASAN KEGPVLSTSN  300
TEPCNLLSPS LVDKSPKLLG FSECASPATP SSVACSSSPG MDDKLFGKTA NVDNYASNLT  360
GSPAPVSESH FARFSFNLEK FDIDSLNNLG SSIIELVQSD DPTSLDSGPM RSNSINKLLI  420
WKADISKVLE MTESEIDLLE NELKSLKSES GETCPCPCPV TLGSQMVGSD EKSCEEHVGV  480
SDQVIRPVPL KIVDDPNTEK MPLSTNLHSI HENGKEEDID SPGTATSKFV EPLPLIKAVS  540
CDTRGHDNFS RDLDTVLSTA VKCLVPCTTR KEASVPACVD GNISMELKDS MDILYKTIIS  600
SNKESANRAS EVFDKLWPKD CCKIEKMEAS SDACTHTFIM EKFAERKQFA RFKERVIALK  660
FRALHHLWKE DMRLLSIRKC RPKSHKKNEL SVRSTCNGIQ KNRSSIRSRF PFPGNQLSLV  720
STSEIINFTS KLLSESQVKV QRNTLKMPAL ILDEKEKMIS KFVSSNGLVE DPLAIEKERA  780
MINPWTPEER EVFLEKFAAF GKDFRKIASF FDHKTTADCV EFYYKNHKSD CFEKIKKQDG  840
DKLGKSYSAK TDLIASGNKK LRAGSSLLGG YGKVKTYRGE DFIEKSSSFD ILGDERETAA  900
AADVLAGICG SLSSEAMSSC ITSSVDPVEG NRDRKFLKVN PLCKLPMTPD VTQDVDDETC  960
SDESCGEMDP TDWTDDEKTA FLRAVSSFGK DFAKIARCVG TRSQEQCKVF FSKGRKCLGL  1020
DLMRPIPENV GSPVNDDANG GESDTDDACV VETGSVVETD KSGTKTDEDL HLYGTNTYHD  1080
ESHPVEARNL SAELNESKEI NWTEVDLEDA NVTSGACQIN IDSKQGCDGS EVFLCGSNKS  1140
GSVGERADII MSDSTEVEND KANKLGGAAT ELISAPNTRE PCQSNSIAED RMVVSEVSSG  1200
GLGNELERHR VSSTLCVDDR DNKHEADSGV IVDMKSSVHD LSTMINSSIS SLGNSCSGLS  1260
FSSENKHVPL GNPHVSALSM DNLHALLQNT VAVDVQCEKT ASQDQMSSTC DIRGGRDMHC  1320
QNSISNGDHQ HITGNLSDHV DAVSILQGYP LQVPVKKEMD SDMNCTSSAT ELPLLPQKIE  1380
HDDDHIKAFQ SSDSDKTFRN GDVKLFGKIL TNPSTTQKPN VGAKGSEENG THHPKLSSKS  1440
SNPKITGHHS ADGNLKILKF DHNDYVGLEN VPMRSYGYWD GNRIQTGLST LPDSAILLAK  1500
YPAAFSNYLT SSAKLEQPSL QTYSKNNERL LNGASTFTTR DINGSNALID YQMFRRDGPK  1560
VQPFMVDVKH CQDVFSEMQR RNGFEAISSL QQQSRGMNGV GRPGILVGGS CSGVSDPVAA  1620
IKMHYSNSDK YGGQTGSIAR EDESWGGKGD
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-16742833494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-16742833494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006589438.10.0PREDICTED: uncharacterized protein LOC100806246 isoform X5
TrEMBLA0A0B2Q7890.0A0A0B2Q789_GLYSO; Nuclear receptor corepressor 1
STRINGGLYMA10G35671.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-170MYB family protein
Publications ? help Back to Top
  1. Qi X, et al.
    Identification of a novel salt tolerance gene in wild soybean by whole-genome sequencing.
    Nat Commun, 2014. 5: p. 4340
    [PMID:25004933]